De-identification of primary care electronic medical records free-text data in Ontario, Canada
Identifieur interne : 000607 ( Main/Exploration ); précédent : 000606; suivant : 000608De-identification of primary care electronic medical records free-text data in Ontario, Canada
Auteurs : Karen Tu [Canada] ; Julie Klein-Geltink [Canada] ; Tezeta F. Mitiku [Canada] ; Chiriac Mihai [Canada] ; Joel Martin [Canada]Source :
- BMC Medical Informatics and Decision Making [ 1472-6947 ] ; 2010.
Abstract
Electronic medical records (EMRs) represent a potentially rich source of health information for research but the free-text in EMRs often contains identifying information. While de-identification tools have been developed for free-text, none have been developed or tested for the full range of primary care EMR data
We used
We found that the modified training program performed with a sensitivity of 88.3%, specificity of 91.4%, precision of 91.3%, accuracy of 89.9% and F-measure of 0.90. The validations sets had sensitivities of 86.7% and 80.2%, specificities of 91.4% and 87.7%, precisions of 91.1% and 87.4%, accuracies of 89.0% and 83.8% and F-measures of 0.89 and 0.84 for the first and second validation sets respectively.
The
Url:
DOI: 10.1186/1472-6947-10-35
PubMed: 20565894
PubMed Central: 2907300
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Pmc, to step Corpus: 000151
- to stream Pmc, to step Curation: 000151
- to stream Pmc, to step Checkpoint: 000149
- to stream Ncbi, to step Merge: 000082
- to stream Ncbi, to step Curation: 000082
- to stream Ncbi, to step Checkpoint: 000082
- to stream Main, to step Merge: 000612
- to stream Main, to step Curation: 000607
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">De-identification of primary care electronic medical records free-text data in Ontario, Canada</title>
<author><name sortKey="Tu, Karen" sort="Tu, Karen" uniqKey="Tu K" first="Karen" last="Tu">Karen Tu</name>
<affiliation wicri:level="1"><nlm:aff id="I1">Institute for Clinical Evaluative Sciences (ICES) G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Institute for Clinical Evaluative Sciences (ICES) G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5</wicri:regionArea>
<wicri:noRegion>M4N 3M5</wicri:noRegion>
</affiliation>
<affiliation wicri:level="4"><nlm:aff id="I2">Department of Family and Community Medicine-University of Toronto, 263 McCaul Street, 5th Floor Toronto, Ontario, M5T 1W7, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Department of Family and Community Medicine-University of Toronto, 263 McCaul Street, 5th Floor Toronto, Ontario, M5T 1W7</wicri:regionArea>
<orgName type="university">Université de Toronto</orgName>
<placeName><settlement type="city">Toronto</settlement>
<region type="state">Ontario</region>
</placeName>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I3">Toronto Western Hospital Family Health Team-University Health Network, 399 Bathurst Street, Toronto, Ontario, M5T 2S8, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Toronto Western Hospital Family Health Team-University Health Network, 399 Bathurst Street, Toronto, Ontario, M5T 2S8</wicri:regionArea>
<wicri:noRegion>M5T 2S8</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Klein Geltink, Julie" sort="Klein Geltink, Julie" uniqKey="Klein Geltink J" first="Julie" last="Klein-Geltink">Julie Klein-Geltink</name>
<affiliation wicri:level="1"><nlm:aff id="I1">Institute for Clinical Evaluative Sciences (ICES) G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Institute for Clinical Evaluative Sciences (ICES) G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5</wicri:regionArea>
<wicri:noRegion>M4N 3M5</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Mitiku, Tezeta F" sort="Mitiku, Tezeta F" uniqKey="Mitiku T" first="Tezeta F" last="Mitiku">Tezeta F. Mitiku</name>
<affiliation wicri:level="1"><nlm:aff id="I1">Institute for Clinical Evaluative Sciences (ICES) G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Institute for Clinical Evaluative Sciences (ICES) G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5</wicri:regionArea>
<wicri:noRegion>M4N 3M5</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Mihai, Chiriac" sort="Mihai, Chiriac" uniqKey="Mihai C" first="Chiriac" last="Mihai">Chiriac Mihai</name>
<affiliation wicri:level="1"><nlm:aff id="I1">Institute for Clinical Evaluative Sciences (ICES) G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Institute for Clinical Evaluative Sciences (ICES) G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5</wicri:regionArea>
<wicri:noRegion>M4N 3M5</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Martin, Joel" sort="Martin, Joel" uniqKey="Martin J" first="Joel" last="Martin">Joel Martin</name>
<affiliation wicri:level="1"><nlm:aff id="I4">Institute for Information Technology, National Research Council, 1200 Montreal Road, Ottawa, Ontario, K1A 0R6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Institute for Information Technology, National Research Council, 1200 Montreal Road, Ottawa, Ontario, K1A 0R6</wicri:regionArea>
<wicri:noRegion>K1A 0R6</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PMC</idno>
<idno type="pmid">20565894</idno>
<idno type="pmc">2907300</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2907300</idno>
<idno type="RBID">PMC:2907300</idno>
<idno type="doi">10.1186/1472-6947-10-35</idno>
<date when="2010">2010</date>
<idno type="wicri:Area/Pmc/Corpus">000151</idno>
<idno type="wicri:Area/Pmc/Curation">000151</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000149</idno>
<idno type="wicri:Area/Ncbi/Merge">000082</idno>
<idno type="wicri:Area/Ncbi/Curation">000082</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000082</idno>
<idno type="wicri:Area/Main/Merge">000612</idno>
<idno type="wicri:Area/Main/Curation">000607</idno>
<idno type="wicri:Area/Main/Exploration">000607</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a" type="main">De-identification of primary care electronic medical records free-text data in Ontario, Canada</title>
<author><name sortKey="Tu, Karen" sort="Tu, Karen" uniqKey="Tu K" first="Karen" last="Tu">Karen Tu</name>
<affiliation wicri:level="1"><nlm:aff id="I1">Institute for Clinical Evaluative Sciences (ICES) G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Institute for Clinical Evaluative Sciences (ICES) G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5</wicri:regionArea>
<wicri:noRegion>M4N 3M5</wicri:noRegion>
</affiliation>
<affiliation wicri:level="4"><nlm:aff id="I2">Department of Family and Community Medicine-University of Toronto, 263 McCaul Street, 5th Floor Toronto, Ontario, M5T 1W7, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Department of Family and Community Medicine-University of Toronto, 263 McCaul Street, 5th Floor Toronto, Ontario, M5T 1W7</wicri:regionArea>
<orgName type="university">Université de Toronto</orgName>
<placeName><settlement type="city">Toronto</settlement>
<region type="state">Ontario</region>
</placeName>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I3">Toronto Western Hospital Family Health Team-University Health Network, 399 Bathurst Street, Toronto, Ontario, M5T 2S8, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Toronto Western Hospital Family Health Team-University Health Network, 399 Bathurst Street, Toronto, Ontario, M5T 2S8</wicri:regionArea>
<wicri:noRegion>M5T 2S8</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Klein Geltink, Julie" sort="Klein Geltink, Julie" uniqKey="Klein Geltink J" first="Julie" last="Klein-Geltink">Julie Klein-Geltink</name>
<affiliation wicri:level="1"><nlm:aff id="I1">Institute for Clinical Evaluative Sciences (ICES) G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Institute for Clinical Evaluative Sciences (ICES) G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5</wicri:regionArea>
<wicri:noRegion>M4N 3M5</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Mitiku, Tezeta F" sort="Mitiku, Tezeta F" uniqKey="Mitiku T" first="Tezeta F" last="Mitiku">Tezeta F. Mitiku</name>
<affiliation wicri:level="1"><nlm:aff id="I1">Institute for Clinical Evaluative Sciences (ICES) G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Institute for Clinical Evaluative Sciences (ICES) G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5</wicri:regionArea>
<wicri:noRegion>M4N 3M5</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Mihai, Chiriac" sort="Mihai, Chiriac" uniqKey="Mihai C" first="Chiriac" last="Mihai">Chiriac Mihai</name>
<affiliation wicri:level="1"><nlm:aff id="I1">Institute for Clinical Evaluative Sciences (ICES) G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Institute for Clinical Evaluative Sciences (ICES) G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5</wicri:regionArea>
<wicri:noRegion>M4N 3M5</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Martin, Joel" sort="Martin, Joel" uniqKey="Martin J" first="Joel" last="Martin">Joel Martin</name>
<affiliation wicri:level="1"><nlm:aff id="I4">Institute for Information Technology, National Research Council, 1200 Montreal Road, Ottawa, Ontario, K1A 0R6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Institute for Information Technology, National Research Council, 1200 Montreal Road, Ottawa, Ontario, K1A 0R6</wicri:regionArea>
<wicri:noRegion>K1A 0R6</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series><title level="j">BMC Medical Informatics and Decision Making</title>
<idno type="eISSN">1472-6947</idno>
<imprint><date when="2010">2010</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><sec><title>Background</title>
<p>Electronic medical records (EMRs) represent a potentially rich source of health information for research but the free-text in EMRs often contains identifying information. While de-identification tools have been developed for free-text, none have been developed or tested for the full range of primary care EMR data</p>
</sec>
<sec><title>Methods</title>
<p>We used <italic>deid </italic>
open source de-identification software and modified it for an Ontario context for use on primary care EMR data. We developed the modified program on a training set of 1000 free-text records from one group practice and then tested it on two validation sets from a random sample of 700 free-text EMR records from 17 different physicians from 7 different practices in 5 different cities and 500 free-text records from a group practice that was in a different city than the group practice that was used for the training set. We measured the sensitivity/recall, precision, specificity, accuracy and F-measure of the modified tool against manually tagged free-text records to remove patient and physician names, locations, addresses, medical record, health card and telephone numbers.</p>
</sec>
<sec><title>Results</title>
<p>We found that the modified training program performed with a sensitivity of 88.3%, specificity of 91.4%, precision of 91.3%, accuracy of 89.9% and F-measure of 0.90. The validations sets had sensitivities of 86.7% and 80.2%, specificities of 91.4% and 87.7%, precisions of 91.1% and 87.4%, accuracies of 89.0% and 83.8% and F-measures of 0.89 and 0.84 for the first and second validation sets respectively.</p>
</sec>
<sec><title>Conclusion</title>
<p>The <italic>deid </italic>
program can be modified to reasonably accurately de-identify free-text primary care EMR records while preserving clinical content.</p>
</sec>
</div>
</front>
<back><div1 type="bibliography"><listBibl><biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Mitiku, T" uniqKey="Mitiku T">T Mitiku</name>
</author>
<author><name sortKey="Tu, K" uniqKey="Tu K">K Tu</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Taira, Rk" uniqKey="Taira R">RK Taira</name>
</author>
<author><name sortKey="Bui, Aa" uniqKey="Bui A">AA Bui</name>
</author>
<author><name sortKey="Kangarloo, H" uniqKey="Kangarloo H">H Kangarloo</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Sweeney, L" uniqKey="Sweeney L">L Sweeney</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Berman, Jj" uniqKey="Berman J">JJ Berman</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Gupta, D" uniqKey="Gupta D">D Gupta</name>
</author>
<author><name sortKey="Saul, M" uniqKey="Saul M">M Saul</name>
</author>
<author><name sortKey="Gilbertson, J" uniqKey="Gilbertson J">J Gilbertson</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Beckwith, Ba" uniqKey="Beckwith B">BA Beckwith</name>
</author>
<author><name sortKey="Mahaadevan, R" uniqKey="Mahaadevan R">R Mahaadevan</name>
</author>
<author><name sortKey="Balis, Uj" uniqKey="Balis U">UJ Balis</name>
</author>
<author><name sortKey="Kuo, F" uniqKey="Kuo F">F Kuo</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Uzuner, O" uniqKey="Uzuner O">O Uzuner</name>
</author>
<author><name sortKey="Sibanda, T" uniqKey="Sibanda T">T Sibanda</name>
</author>
<author><name sortKey="Luo, Y" uniqKey="Luo Y">Y Luo</name>
</author>
<author><name sortKey="Szolovits" uniqKey="Szolovits">Szolovits</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Szarvas, G" uniqKey="Szarvas G">G Szarvas</name>
</author>
<author><name sortKey="Farkas, R" uniqKey="Farkas R">R Farkas</name>
</author>
<author><name sortKey="Busa Fekete, R" uniqKey="Busa Fekete R">R Busa-Fekete</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Uzuner, O" uniqKey="Uzuner O">O Uzuner</name>
</author>
<author><name sortKey="Luo, Y" uniqKey="Luo Y">Y Luo</name>
</author>
<author><name sortKey="Szolovits, P" uniqKey="Szolovits P">P Szolovits</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Wellner, B" uniqKey="Wellner B">B Wellner</name>
</author>
<author><name sortKey="Huyck, M" uniqKey="Huyck M">M Huyck</name>
</author>
<author><name sortKey="Mardis, S" uniqKey="Mardis S">S Mardis</name>
</author>
<author><name sortKey="Aberdeen, J" uniqKey="Aberdeen J">J Aberdeen</name>
</author>
<author><name sortKey="Morgan, A" uniqKey="Morgan A">A Morgan</name>
</author>
<author><name sortKey="Peshkin, L" uniqKey="Peshkin L">L Peshkin</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Thomas, Sm" uniqKey="Thomas S">SM Thomas</name>
</author>
<author><name sortKey="Mamlin, B" uniqKey="Mamlin B">B Mamlin</name>
</author>
<author><name sortKey="Schadow, G" uniqKey="Schadow G">G Schadow</name>
</author>
<author><name sortKey="Mcdonald, C" uniqKey="Mcdonald C">C McDonald</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Neamatullah, I" uniqKey="Neamatullah I">I Neamatullah</name>
</author>
<author><name sortKey="Douglass, Mm" uniqKey="Douglass M">MM Douglass</name>
</author>
<author><name sortKey="Lehman, Lh" uniqKey="Lehman L">LH Lehman</name>
</author>
<author><name sortKey="Reisner, A" uniqKey="Reisner A">A Reisner</name>
</author>
<author><name sortKey="Viallarroel, M" uniqKey="Viallarroel M">M Viallarroel</name>
</author>
<author><name sortKey="Long, Wj" uniqKey="Long W">WJ Long</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Sokolova, M" uniqKey="Sokolova M">M Sokolova</name>
</author>
<author><name sortKey="El Emam, K" uniqKey="El Emam K">K El Emam</name>
</author>
<author><name sortKey="Chowdhury, S" uniqKey="Chowdhury S">S Chowdhury</name>
</author>
<author><name sortKey="Emilio, N" uniqKey="Emilio N">N Emilio</name>
</author>
<author><name sortKey="Rose, S" uniqKey="Rose S">S Rose</name>
</author>
<author><name sortKey="Jonker, E" uniqKey="Jonker E">E Jonker</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Velupillai, S" uniqKey="Velupillai S">S Velupillai</name>
</author>
<author><name sortKey="Dalianis, H" uniqKey="Dalianis H">H Dalianis</name>
</author>
<author><name sortKey="Hassel, M" uniqKey="Hassel M">M Hassel</name>
</author>
<author><name sortKey="Nilsson, Gh" uniqKey="Nilsson G">GH Nilsson</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Grouin, C" uniqKey="Grouin C">C Grouin</name>
</author>
<author><name sortKey="Rosier, A" uniqKey="Rosier A">A Rosier</name>
</author>
<author><name sortKey="Dameron, O" uniqKey="Dameron O">O Dameron</name>
</author>
<author><name sortKey="Zweigenbaum, P" uniqKey="Zweigenbaum P">P Zweigenbaum</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<affiliations><list><country><li>Canada</li>
</country>
<region><li>Ontario</li>
</region>
<settlement><li>Toronto</li>
</settlement>
<orgName><li>Université de Toronto</li>
</orgName>
</list>
<tree><country name="Canada"><noRegion><name sortKey="Tu, Karen" sort="Tu, Karen" uniqKey="Tu K" first="Karen" last="Tu">Karen Tu</name>
</noRegion>
<name sortKey="Klein Geltink, Julie" sort="Klein Geltink, Julie" uniqKey="Klein Geltink J" first="Julie" last="Klein-Geltink">Julie Klein-Geltink</name>
<name sortKey="Martin, Joel" sort="Martin, Joel" uniqKey="Martin J" first="Joel" last="Martin">Joel Martin</name>
<name sortKey="Mihai, Chiriac" sort="Mihai, Chiriac" uniqKey="Mihai C" first="Chiriac" last="Mihai">Chiriac Mihai</name>
<name sortKey="Mitiku, Tezeta F" sort="Mitiku, Tezeta F" uniqKey="Mitiku T" first="Tezeta F" last="Mitiku">Tezeta F. Mitiku</name>
<name sortKey="Tu, Karen" sort="Tu, Karen" uniqKey="Tu K" first="Karen" last="Tu">Karen Tu</name>
<name sortKey="Tu, Karen" sort="Tu, Karen" uniqKey="Tu K" first="Karen" last="Tu">Karen Tu</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000607 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000607 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= PMC:2907300 |texte= De-identification of primary care electronic medical records free-text data in Ontario, Canada }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i -Sk "pubmed:20565894" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd \ | NlmPubMed2Wicri -a OcrV1
This area was generated with Dilib version V0.6.32. |